3D shape template generation from RGB-D images capturing a moving and deforming object
نویسندگان
چکیده
Automatically reconstructing a 3D shape model of a nonrigid object using a sequence from a single commodity RGB-D sensor is a challenging problem. Some techniques use a 3D shape template of a target object; however, in order to generate the template automatically, the target object required to be stationary. Otherwise, a non-rigid ICP algorithm, which registers a pair of point clouds, can be used for reconstructing 3D geometry of a non-rigid object directly, but it often fails due to the ambiguity in point correspondences. This paper presents a method for generating a 3D shape template from a single RGB-D sequence. In order to reduce the ambiguity in point correspondences, our method leverages point trajectories obtained in the RGB images, which can be used for associating points in different point clouds. We demonstrate the capability of our method using deforming human bodies. Introduction Recently, various applications that present a moving and deforming object to users, such as virtual fitting room [1] and virtual pets [2], have become available to ordinary users. These applications render objects based on 3D geometry of their entire shapes (which we refer to as full-body shape models) and their non-rigid motion, both of which are usually handcrafted. Automatic techniques for reconstructing full-body shape models at each frame can drastically reduce the cost for creating a 3D shape model and motion (e.g., [3, 4]). They use multiple sensors (e.g., RGB or RGB-D sensors), whose relative poses are known, to capture the object from different viewpoints simultaneously. It then applies an existing 3D reconstruction technique for rigid objects, such as [5, 6, 7]. However, the use of multiple sensors may be still cumbersome for some applications in which ordinary users need their own shape models and motions. Reconstructing 3D shape and motion from a single sensor is a challenging problem. Two approaches have been proposed: one uses 3D shape templates of the target object and the other does not. Former approach [8, 9] generates a 3D shape template using a 3D shape reconstruction technique for rigid objects [5, 6, 7], assuming the object is almost stationary. They then fit the 3D shape template to a 3D point cloud at each frame of a single depth map sequence. One major limitation of this approach is that it requires an extra burden to capture the target object while it is stationary, which is practically infeasible, especially for objects like animals. Latter approach [10, 11] registers 3D point clouds in all frame of a single depth map sequence to any other frames using non-rigid iterative closest point (ICP) [12, 13]; however, it often 3D shape template
منابع مشابه
Generating a 3D shape template of a moving and deforming object from an RGB-D image sequence
Automatically reconstructing a 3D shape model of a nonrigid object using a sequence from a single commodity RGB-D sensor is a challenging problem. Some techniques use a 3D shape template of a target object; however, in order to generate the template automatically, the target object required to be stationary. Otherwise, a non-rigid ICP algorithm, which registers a pair of point clouds, can be us...
متن کاملمدلسازی صفحهای محیطهای داخلی با استفاده از تصاویر RGB-D
In robotic applications and especially 3D map generation of indoor environments, analyzing RGB-D images have become a key problem. The mapping problem is one of the most important problems in creating autonomous mobile robots. Autonomous mobile robots are used in mine excavation, rescue missions in collapsed buildings and even planets’ exploration. Furthermore, indoor mapping is beneficial in f...
متن کامل6-DOF Model Based Tracking via Object Coordinate Regression
This work investigates the problem of 6-Degrees-Of-Freedom (6-DOF) object tracking from RGB-D images, where the object is rigid and a 3D model of the object is known. As in many previous works, we utilize a Particle Filter (PF) framework. In order to have a fast tracker, the key aspect is to design a clever proposal distribution which works reliably even with a small number of particles. To ach...
متن کاملDeforming Motion , Shape Average and the Joint Registration and Segmentation of Images ∗
What does it mean for a deforming object to be “moving”? How can we separate the overall motion (a finite-dimensional group action) from the more general deformation (a diffeomorphism)? In this paper we propose a definition of motion for a deforming object and introduce a notion of “shape average” as the entity that separates the motion from the deformation. Our definition allows us to derive n...
متن کاملHand Gesture Recognition from RGB-D Data using 2D and 3D Convolutional Neural Networks: a comparative study
Despite considerable enhances in recognizing hand gestures from still images, there are still many challenges in the classification of hand gestures in videos. The latter comes with more challenges, including higher computational complexity and arduous task of representing temporal features. Hand movement dynamics, represented by temporal features, have to be extracted by analyzing the total fr...
متن کامل